Evaluation of Tectogrammatical Annotation of PDT
نویسندگان
چکیده
Two phases of an evaluation of annotating a Czech text corpus on an underlying syntactic level are described and the results are compared and
منابع مشابه
The Meaning of the Conditional Mood Within the Tectogrammatical Annotation of Prague Dependency Treebank 2.0
The conditional form is one of the moods of Czech verbs, and it renders several meanings in contemporary Czech texts (Sect. 2). The present paper focuses on the primary function of this mood, which is to express hypothetical events (Sect. 3). In Section 4, we briefly mention how modality has been treated up to now in PDT 2.0 and some other treebanks and finally in Section 5 we propose a new way...
متن کاملAutomatic Procedures in Tectogrammatical Tagging
A semi-automatic syntactic annotation of a part of the Czech National Corpus in the Prague Dependency Treebank (PDT) has among its aims the possibility to check the theoretical approach chosen (Functional Generative Description, see [2]). While the first phases of the annotation of PDT, i.e. the morphemic representations and the dependency trees on an intermediate analytic level, i.e. analytic ...
متن کاملAnnotation Tool for Discourse in PDT
We present a tool for annotation of se mantic intersentential discourse rela tions on the tectogrammatical layer of the Prague Dependency Treebank (PDT). We present the way of helping the annotators by several useful features implemented in the annotation tool, such as a possibility to combine surface and deep syntactic representation of sen tences during the annotation, a possibili ty to ...
متن کاملPDT: Two Steps in Tectogrammatical Annotation with respect to some Issues of Deletion
The annotation of the Prague Dependency Treebank is realized in two sub-collections which differ in the subtlety of annotation (the large collection and the model collection). In the present paper, we focus on deletions of complementations of verbs, postverbal nouns and adjectives, from the point of view of the annotators of the model collection. We inquire into the issues of deletions of parti...
متن کاملSynthesis of Czech Sentences from Tectogrammatical Trees
In this paper we deal with a new rule-based approach to the Natural Language Generation problem. The presented system synthesizes Czech sentences from Czech tectogrammatical trees supplied by the Prague Dependency Treebank 2.0 (PDT 2.0). Linguistically relevant phenomena including valency, diathesis, condensation, agreement, word order, punctuation and vocalization have been studied and impleme...
متن کامل